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ABSTRACT 


The problem of designing optimal inputs in the identification of linear 
systems with unknown random parameters is considered using a Bayesian 
approach. The information matrix, which is positive definite for the 
class of systems analyzed, gives a measure of performance for the sy- 
stem inputs. The computation of the optimal closed- loop input mappings 
is shown to be a nontrivial exercise in adaptive control. Deterministic 
optimal inputs are shown to be easily computable. Numerical examples 
are given. A Kalman filter is used to estimate the parameters. A 
necessary condition for the Kalman filter not to diverge when applying 
linear feedback, is also given. 
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INTRODUCTION 


The problem of designing deterministic system inputs to en- 
hance the performance of some identification algorithms has received 
some attention in the literature. The formulations are based on the 
Fisher Information Matrix and on least squares identification. Continuous- 
time systems have been discussed by Levadi [1], Nahi et al [2,3] and 
Mehra [4] . Deterministic systems observed through additive Gaussian 
noise are considered in those works. Discrete-time single-input single- 
output systems are considered by Gagliardi [5] , Aoki and Staley [6] , 

Goodwin et al [7] and more recently by Mehra [8] whose frequency-domain 
approach allows multi-output systems. 

The more general results have been obtained for formulations 
that lead to a linear model [1] , [8] , where linear least-squares 
techniques can be applied. The Fisher Information approach usually 
yields equations that depend on the unknown parameters [2, 3, 4, 6, 7] and 
thus some suboptimal scheme has to be used. 

In this paper, we shall be concerned with a particular class of 

linear systems , namely those whose unknown random parameters appear 

linearly in the system description. Using a Bayesian approach, we 

will attempt to design optimal closed-loop inputs via the Information 

Matrix criterion. In Section II we indicate that this is a difficult 
problem in adaptive control. Section III gives the solution to the 
problem for deterministic inputs and some examples are given in 
Section IV. Finally, Section V gives an asymptotic result for iden- 
tification of closed-loop systems. 
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XI. OPTIMAL CLOSED-LOOP INPUT MAPPINGS 

In this section we will derive conditions that the optimal feed- 
back input mappings, maximizing the sensitivity of the system output 
with respect to the parameters, must satisfy. This sensitivity criterion 
is also shown to be equivalent to maximizing the trace of the Information 
Matrix (which is positive definite for the class of systems considered) . 
Consider the discrete-time linear system 1 

n m 

^(k) - J A. (k-i)y(k-i) + £ B_. (k-i)u(k-i) + £(k) (2.1) 

i=l 1 i=l 1 

E* S 

where ;^(k) £ R and u(k) t R are the observed system output and input, 

respectively, 9/k) e R r is a white Gaussian noise with known statistics 

(5, . is Kronecker's delta) E{0(k)} = 0; E{0 (k) 0 T ( j ) } = 0(k)6 ; 0(k) > 0 

K] — — — kj 

all k, and where A^ (k) e R and B^ (k) e R are partially known 
random matrices. 

It is assumed that the initial conditions y (0) , . . . ,£(-n+l) and 
u(-l) = . . . = u(-m+l) = £ are known. 

By an appropriate rearrangement of Eq. (2,1) as noisy observations 
on the unknown parameters , one can apply the results of Kalman filtering 
theory. Assume that all elements of the A. (k) and B. (k) matrices are 

"T. “1 

unknown random parameters varying according to a Gauss-Markov process 
of the following form. Let a."^ (k) and b.^ (k) be the j-th row of A, (k) 

i — i —l 

"^Although this system has a very particular structure, it can be used 
in a large class of realistic problems. See e.g. £H], [12], [13], 
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and respectively, the vector x(k) e Rnr +msr of unknown parameters 

at the k-step is formed by placing the rows of A. (k-i) one after another 

-a. 

in consecutive order j = l,...r and for i = l,.,.n, followed by the 
rows of B^(k-i) in the same ordering fashion, i.e. 

I (k) = [a, 1 (k-l]a 2 (H)...a 1 (k-n)...a n (k-n)b. 1 (k-1) . . .b m (k-m)] T 
j. i ~ n — n —1 m 


then assumed to vary according to 


X(k+1) = £(k)x(k) + £(k) 


( 2 . 2 ) 


where F^(k) is known and X(^) and £(k) are mutually independent (also 
independent of 0(j)) Gaussian variables with known statistics, 

e{x(0) } - x , e{(x-X> (X‘X) T > = Lf 

E{£(k)} = o , E{£(k)£ T (j)} = 5(k)6 


With the above definitions, Eq. (2.1) can be rewritten as the 
linear model 


£(k) = C T (k)X(k) + _0 (k) = h (k) + £(k) 


(2.3) 


, rW „ _(nr +msr)xr . 

where C^(k) e R is given by 


T 

C (k) = 


X (k-1) 0 

0 X T (k-l) 


0 x T (k-2). • • *y T (k-n) . .0 u T (k-l). 


0 . . . *x (k-1) 0 


0...y (k-n) 0 . . 


u (k-m) ... 0 


u T (k-m) 
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The following lemma is merely a statement of the Kalman filter 
and will be used in the proof of Theorem 2.1 below. Its proof requires 
a modification of Kalman's original proof to take into account that C(k) 
is random. A formal proof can be obtained paralleling Striebel's [9] 
derivation of the conditional distribution for a class of systems. 


Lemma 2 . 1 

Let y^ = {^(1) , . . . ,^(k) } be the observed outputs of system (2.1) 
and assume that the inputs are of the feedback type (nonanticipative) 
u(k) = u(k,y ). Then the a posteriori density p(]f(h) |y ) is Gaussian 

K K 

with mean and covariance given respectively by 

£(k) = F(k-l)£(k-l) + Z(k|k)C(k)0“ 1 (k) (y(k)-C T (k)F(k-l)£(k-l)) 

(2.4) 


and 

Jc | k) = <£ -1 (k|k-l) + C(k)0'' 1 (k)C T (k))“ 1 (2.5) 

where Z(kjk-l) = F(k-l) Z_{k~l|k-1) F T (k-l) + H(k-l) and 

Z(0|0) = > 0, £(0) - £ • 

If some of the parameters are known, one can write 
X.00 “ £(k) + £^(k)x^(h) + £(k) , where X-^(k) is the vector of 

unknown parameters. Once again Striebel's procedure can be applied 
to derive pfX^M jy^). The result is to substitute £(k) by x(k)-£(k) 

and Cjk) by £^(k) in the above equations. 

Equations (2.4) and (2.5) provide a recursive estimator for 

A 

the unknown parameters, which is known to be unbiased (E{x(k) } = E^OO}) 


and minimizes quadratic estimation errors. Moreover, if the parameters 
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are mutually independent or are simple unknown constants, Eqs. (2.4) and 
(2.5) can be broken down to yield r Kalman filters of smaller dimension 
[ 10 ]. 

We now introduce some more definitions in order to state the 
. NsxNs 

mam theorem of this section. Let 2 e R be the diagonal matrix. 

s s ^ 

Q - Diag[(q 1 +. . .+q ]n )I^ , (q 2 +. . .+q n+1 )JT ' " * (q N-m + * * * +q n-l 5 - ' 

(q N-itH-l + ‘ * * +q N ) — S ' * * * ' (q N-l +q N } i S ' q N^ 5 } 1 Wh6re q k = tr - _1 00 and 
X s is the s x s identity matrix. Similarly, let P_ £ R^ N D rx ( N D r 

r 

be the diagonal matrix £ = Diag[ (q 2 +. . .tq^^) .1 ' * * ‘ ' ^N-n + ‘ ' * +q N-l^— ' 
(q N _ n+1 +. . .+q n )£ r ,. . . (q^j^l+qjj) I 17 , q^*] and define the submatrices 
sxs 

2^ £ R and £ R in correspondence with 2 and X as follows 
2 = Diagl^, . . . 

R. = Oiag[Pj, . . . t = 0 

Theorem 2.1 

Let ]£(k+l) = x<k) " X X(°^ a Gaussian variable with 

mean x and covariance > 0 and let be the Information Matrix 

% - U Wy log P(Y N ^I)) T (V y log p(^,X>)^dx ( 2 . 6 ) 

T T T 

where V denotes gradient and Y = [y (l)...y (n) ] . 

Then, for system (2.1) we have that 

% i V 1 ’ 


(i) > £ 


(2.7) 
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(ii) 


The following problems are equivalent (maximization is with 
respect to input mappings u(k,Y^)) 

(a) Max tr (2.8) 

(b) Max of the sensitivity index 


fN-1 r / 8h, (k)\ T /3h. (k)\ 

E |Jo l P ^ ,k ’ (-5T-) (iT 


(2.9) 


where p^(k) = [0 is the i - th diagonal element 

of © ^ (k) and h^ is the i*-th element of li (Eq. 2.3). 


(c) Max 


N-l 


l (£ WP^fk) + uNkJg^ufk))! 


( 2 . 10 ) 


Proof. See Appendix A. 


Remarks . 

It is known that is related to the error covariance matrix 
of any unbiased estimator of £ by the Cramer-Rao inequality 


— N ~ -^J ~ — 


-1 


( 2 . 11 ) 


Eq* (2*7) indicates that has only nonzero eigenvalues and since 
1/tr and tr go to zero at the rate 0(1/A min) as A^^ + » [6] , 

where A^^ is the minimum eigenvalue of M^ # Eqs* (2*8) and (2*11) give 

a relationship of the optimal inputs with mean square estimation errors. 
Eqs. (2*9) and (2.3) show that the inputs should maximize the sensitivity 
of the system output with respect to the parameters, where p^(k) has the 
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interpretation that if the noise is "high", the output contains "little 
information" about the parameters. Eq. (2.10) is derived from either 
(2.8) or (2.9) and remains unchanged if one considers time-varying 
Markov parameters (Eq. (2.2)) with the index (2.9) changed to 


N— 1 r 

l l Pi 00 


k=0 i=l 


( 3h i <k)1 

T 

fah.uon 

1 9X< k >j 


l ai(k)j^ 


( 2 . 11 ) 


Eq. (2.10) implies that this type of inputs for identification is con- 
flicting with any input aimed to regulate the system. 


III. OPTIMAL OPEN-LOOP INPUTS . 

It is not difficult to check that; unfortunately, the maximi- 
zation of Eq. (2. 10) , subject to an energy constraint, cannot be per- 
formed analytically. Some suboptimal schemes, developed in the theory 
of adaptive controllers may however be applicable, in this section, 
we give the solution to the problem for the case of deterministic 
inputs. We will consider only the case of constant random parameters, 
the extension to time- varying parameters with index (2.11) is completely 
straightforward. 


Let us define 

A C *<"- 

1) rx(N-l) r 

and 

£e r (H- 1)«(*-1). by 

-I. 

. . o" 



. . 0 




—2 

* 

• 

t t « 



m 

• 

• * 

• • 

• 

m 

m 

j 

B « 

• 

m 

■ ■ 

A 

— n . 

* 


B 

-in 

• * 

• 



* 

• « • 

♦ 



• 

• • • 

0 • f A • • m 

—n 

1 

V 

<\ 


0 . 

» » B * » m B_ B 

— m -2 — 1 
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Let h £ R^ N be a vector depending on the initial conditions through 




f n It i 

f n \ 

h = 



I A.Z(2-i) 



^i=l / ' 

[i=2 i 


(A£<0)) v q . . . o 7 

(N-n-l)r zeros 


(3*2) 


NsxNs 

Let R e R be the symmetric positive definite matrix 


H « £ + E< 


T -T -1 
BA P A B 0 


(3.3) 


—T —X T Ns 

where A = (_A ) always exists since det A = 1, and let d £ R 


d = E{ [ (bV T P a' 1 !!) T 0 . . . 0] T } 


(3.4) 


v 
s zeros 


Then we can state the following theorem. 

Theorem 3.1 

T T T Ns 

Let - [u. (0) . . . (N-l) ] £ R be the vector of the first 

N system inputs. The optimal input sequence u* (0) , . . . ,u* (N-l) that 
solves part (ii) of Theorem 2.1, subject to the energy constraint 

N-l m 

r T 

L ii (i)u(i) 5. W, is given by the vector 
i=0 

IJy* = [~H + X I] -1 d if d ? 0 (3.5) 

where is the Ns x Ns identity matrix and X is such that 

<V )T( 2 m * ) - w (3.6) 


If 1 then is the eigenvector corresponding to the 

maximum eigenvalue of H, normalized according to Eq. (3.6) • 

Proof See Appendix B, 
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Remarks . 

In Eq. (3. 5) X is a Lagrange multiplier* If the initial conditions are 
zero, or e{b} ** 0 and B. is independent of A (element-wise) then <3 - Ok 
These conditions are however not necessary for d to be zero* From 
Eqs. (3.1) through (3.4) it follows that and d depend on the noise 
statistics, on the initial conditions and on E{y^ }, k - l,...,2N-4, 
where y i is the i-th unknown parameter. The only difficulty in the 
computation of explicit expressions for H and d is that of notation. 

As an example, the single-input single-output (SISO) case is considered 
next. 


IV. EXAMPLES 


Example 1 

The SISO system is described by the equation 


n m 

y(k) = l a y(k-i) + £ b u(k-i) + 0(k) (4.1) 

i=l i-1 


Proposition 

T 

If X = I a i* * a n ^i* a vector of Gaussian mutually inde- 


pendent random variables , the expectations required in H and d can be 


obtained as follows 
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e{b 


m m 

:bV T P A^b} = £ E{b }E{b.>0 + £ E{b 2 }£ 

i,j=l J 3 i=l 


i/j 


2N-4 


+ ][ £ E{a 1 l}*..E{a n} 


k~l k, + . • .+k “k 
1 n 

k el 
a 


m 

l E{b i }E{b i } £iik 

i,j=i 1 3 X3K 


L i*) 


m 0 

+ l E{b.}R, j 
x -ilk 


i=l 


e{bV T P A -1 b} = l l I E{a.}E{b }y H f 


n m i 


i=l j*=l £=1 * J ~ ^ 

2N-4 m i 

h l l l l 

k=l j=l k +. . .+ (k.+l) +. ,+k =k+l A=1 
l . . x n 

l-l , . . . ,n 

k ei 

a 


E{ a;L k l}. . .E{a ± 1+1 }. . -E{a n k n}E{b ; . 


where k^ £ I reads "for all k^ £ {0,1 a = l,...n" and the matrices 

— ijk'^ij £ R ^ N and Eg,kj e rN 1 can be precomputed for 

fixed N(they depend only on the noise statistics 0{k) , k = 2,...,N, 
and if 0{k) = 0 all k, they can be replaced by constant matrices 
independent of 0) . 

If X a ve ct°r of jointly Gaussian variables, with mean ]£ 
and covariance Z, the transformation y^ = where £_ satisfies 

‘J 1 ijl 

i£ Y £ = Diagonal Matrix and £T = i r yields a set of mutually 
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independent Gaussian variables. The expressions for the expectations 
are then similar to those above. 

Proof See Appendix C. 


Example 2 (Numerical example) . 

Consider the two-input one-output system 
y(k) = ay(k-l) + b^u^fk-l) + b 2 u 2 (k-l) + 0(k) where 0 (k) is white 


Gaussian N(0,1). The following two cases of random parameters are 
analyzed. 

Case (i) a,^ and b 2 are jointly Gaussian with 



The inputs, computed according to Section III are (N=10) shown in Pig. 4.1. 
Case (ii) afb^ and b^ are jointly Gaussian with 



i.l 0 0 t \ 

0 .1 .1 
i 0 .1 1 IJ 


(4.3) 


The optimal inputs for this case are shown in Pig. 4.2. 

The inputs of Fig. 4.3 with the same energy (W = 85.23) that 
those of Figs. 4.1 and 4.2 were used as comparison. 

Mean square errors for 2500 Monte Carlo runs were computed, 
the results are shown in Table 4.1. 



* 

* 


Table 4.1 Mean Square Errors for Different Inputs 

From the figures we can see that u^* contains most of the energy 
of the optimal inputs. This agrees with our intuition, since b 2 , the 
parameter multiplying u 2 has the largest uncertainty. The results of 
the simulation (Table 4.1) indicate that the mean square estimation 
errors corresponding to the optimal inputs (indicated by *) are approx- 
imately only 50% of those corresponding to constant inputs. 

The optimal input for system (4.2) , Fig. 4.1, is of the impulse 
type. An interesting result follows by comparing the optimal inputs 
for system (4.3) with the impulse type input Fig. 4.1 (3th vs. last 
row in Table 4.1) . The former gives a 10 % reduction on mean-square 
estimation errors corresponding to the latter. Thus impulse type 
inputs are not always the best. 

V. ASYMPTOTIC ANALYSIS 

Most identification algorithms have beer* designed on the basis 
of an open-loop operation of the plant and erroneous results may be 
obtained if one uses such algorithms for closed-loop system identification 
[14] [15]. Nevertheless, if one is careful about the type of feedback 
being used, then correct results are obtained. 


System 

Input 

tr Iio 

(4.2) 

Fig. 4.1 

.1281 

Fig. 4.3 

.2510 

(4.3) 

Fig. 4.2 

.1163 

Fig. 4.3 

.2062 

Fig. 4.1 

.1287 
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Consider for instance the system [16] 

x(k) + ax(k-l) = bu(k-l) + 0(k) 

u(k) = ax(k) (5.1) 

It is well known that the Kalman filter on least squares estimate can 
be obtained minimizing 

N 2 _ _ , a-a \ 

J(a r b ) I (x(k+l) + ax(k) - bu(k)) + (a-a b-b)V 7 (5.2) 

k=l -*> \b-b / 

where a, b ' -ab is the a priori information. As N 00 the a priori 

information becomes negligible. Thus Eq. (5.2) becomes 
N 

J(a,b) “ £ (x(k+l) + ax(k) - bu(k))^ 

k=l 

N 

" l (x(k+l) + (a+aX)x(k) - (b+X)u(k )) 2 
k=l 

= J(a + aX, b + X) (5. 3) 2 

and the parameters are not identifiable. They become identifiable if 
the feedback loop is changed to u(k) = o^xfk) + a 2 x(k-l). 

The following theorem gives a necessary condition for the 
Kalman filter (Eqs. (2. 4), (2. 5)) not to diverge when linear feedback 
is being used. Only the SISO case is considered. The extension to 
multi-input multi-output systems is conceptually straightforward. 


Note that this cost function can also correspond to a maximum 
likelihood estimate. 
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Theorem 5.1 

Let the system be 

n m 

y(k) = l a^fk-i) + J b ± u{k-i) + 0(k) (5.4) 

i=l i=l 

2 T 

where 0(k) is white Gaussian N(0,cr )• Let Y ~ [a- • • • a b ..,b 1 

^ 1 n 1 m 

be a Gaussian random vector N(y,Z ) , and assume that linear feedback 

y 

A 

u(k) - l a y(k-i) (5.5) 

i=0 

is being used. Then, if A < n we have that the mean square errors 
satisfy 

lim e{(Y - X<N)) T (X - Y(N))} + 00 (5.6) 

N-x” 

Proof . See Appendix D. 

VI. CONCLUSION 

The design of inputs for system identification has been con- 
sidered using a Bayesian approach. The optimal identifying inputs are 
shown to be conflicting with a system regulator. Optimal deterministic 
inputs have been shown to be easily computable and digital simulation 
shows that they yield smaller mean square estimation errors than arbi- 
trary inputs. The divergence of the Kalman filter— estimator for 
closed- loop systems has been analyzed. 



APPENDIX A 


From the definition of the Information Matrix given by Eq. (2.6) , 
= E{(V y log P(Y n /X>) T (V y log pO^Y))} (A.D 

Using Baye's rule we get 

= e{(V y log + V Y log pO^)) 1 ^ log pfX)^) 

+ V y log p (Y^ ) } 

i.e. = E{(V y log p(YiV) T (V Y log P ( XlV )} 

From the results of Lemma 2.1, the density pCxIX^) for an Y set 
admissible (nonanticipative) input mappings u(k,Y k ) is Gaussian with 
mean and covariance given by ;y(N) and £(n|n), thus 

log ptxl]^) = - ( X “ X (n >> T I~ 1 (n|n) 

and using the properties of conditional expectations, can be written 
as 

^ - e{e{Z~ 1 (n|n) (y - £(n)) (X ~ Y( N >) T i~ 1 ( N i N > 

or 

= e{^ _1 (n1n)} (A. 2) 

Now, from Eq. (2.5) of the lemma 

£ -1 (k|k) = _E _1 (k-l|k-l) + C(k)0 _1 (k)C T (k) 

then 

-1 , N 

Z (n|n) = l~ + l C(k)0 -1 (k)C (k) (A. 3) 

T k=l 

Substitution of (A. 3) into (A. 2) proves part (i) of the Theorem. 


- 15 - 
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Maximization of tr is, by (A. 2) and (A. 3) equivalent to 
maximization of 


N 


J x - tr E-j l C T (k)C(k)0" 1 (k) 


(A. 4) 


_k=l 

Using the definitions of C^(k) , li (k) and (k) , it follows after some 
algebra that Eqs. (2.9) and (A. 4) are equivalent. This proves parts 
(ii-a) and (ii-b) of the theorem. 

We also have that 

T / 5 T 5? m 

C (k)C(k) = X X (k~i) y(k-i) + l u T (k-i)u(k-i) 

\ i=l i=l 

where I is the r x r identity matrix. Thus, using the definition of 

\ we get 

tr[C T (k)C(k)0 _1 (k)] « q k . 


11 T„ , ? T, 


X £ (k-i)^(k-i) + l u T (k-i)u(k-i) 
i-1 i-1 


Adding N of these terms as indicated by (A* 4) and using the definitions 
of P and 2. have 


J i - 4-i + a V 

where = [£ T (1) . . ,£ T <N-1) ] , U^ = [u T {0) . . .u T (N-l) ] . 

Using the partitions of and £, from (A. 5) we get 
N-l 

J = E- l (y <k)P k y(k) + u T (k)^u(k)) 

This completes the proof. 


(A. 5) 



APPENDIX B 


For deterministic inputs, (A. 5) can be written as 

J i * tr £ + sfi 


or 


where 


J i - tr £ - V ♦ %' r 2 a 


— y —N -*N 


(B.l) 


Vi - E{ Vi^> 

4 = E{l Vi-a-i 1, Vi-v/'i 1 

It is easy to see that the first N-l system outputs satisfy 

**H-l + £aHl + ia«-l + a-0 (B.2) 

where A, B and h are defined in the Theorem, I e R ( N_1 ) rx(N-l) r ig 


the identity matrix and 


= [e T (i)....e T {N-i)3 T 

Since A 1 always exists, the vector 

Vi - - i : l <£ Vi + Vi + y 

has a Gaussian conditional (on yj density , with 

Vi ’ - £ 1 Vi * y 

E = A _1 0 a' T 


where 0 = Diag (0(1) ,. . . ,0 (n-1) ) . 

Therefore, substituting into (B.l) and considering only terms 
that depend on U, we must maximize 


- 17 - 
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v - Vi T ^V T £ + A~\} + Hn 


Using H and d as defined in the Theorem we have 


T T 

J’«U HU +2U d 

1 —N — — N ^ - 


To maximize J^' subject to the energy constraint 


2» 


notice that since is strictly convex, the optimum occurs when 
T 

U U, = W, the optimization is thus equivalent to 
—N — N 


Min -U T H U., - 2U 1 T d + X<u T U - W) 
— N — — N -W — — N — N 


where X is a Lagrange multiplier. The necessary condition gives 

H V* + d - XU * = 0 (B. 3) 

— — N — — N 

If d_ ? £, u^* = [-£ + XI] ^d, which proves the first part of the Theorem. 

If d * 0, then (B.3) gives H U * = Xu *, i.e. U* is an eigenvector of H. 
— — N — N — N — 

Ns 

Since for all x £ R 

X . (H)x T x < (J. ' = x T H x) < X (H)x T x 
min — — 1 — — — — max — 

the cost attains its maximum when U^* is the eigenvector corresponding 
to the maximum eigenvalue of H. This completes the proof. 



APPENDIX C 


_ . -1 (N~l) x(N-l) ^ , . 

The inverse A e R can be obtained via 


A -1 = - (I + G + ... + G* 1 " 2 ) 


where 


£- l M 

1=1 


where 


£ R^ N 1)x(N 1) shift matrix with i-jth element 




Since G = Y a. ...a. I. ...X, 

" i x 0^=1 X 1 V^l "S 


we have that 

.-T„ -1 


-T -1 r T 

A P A = P + > a. (P I. + I. P) 

“ i 1, i. “I. - 


i^l *1 1 1 


n 


+ I a. a, {P I. I. + I. p i. + I. T T ) 

i, ,i„-l l l X 2 - ~ ~ 1 2 "2 h. £ 


12 


+ « « I 


n 


r T T 

2 a. ...a. U. ...I. P 1^ ..I. 

i l'** ,i 2N-5 =1 1 1;2N “ 5 1n " 3 11 N-2 ”^ L 2N-5 


T T 

+ I. ...I, P I. ...I, ) 

N-2 1 N-l 2N-5 


n 

V T T 

+ 2 a i ...a. (1^ ...I. PI. ..I. 

± 1'** ,iL 2N-4 =1 1 2N-4 N " 2 N-l "^2N-< 
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or 


-T -1 
A P A 


2N-4 

I + l 


k=l i , . . , i^=l 1 


n 


where is defined by correspondence. 


m 


Expressing B as B — J b.I. , we have that 
— — . _ X“l~l 

1=1 


T -T -1 5J 2N-4 n 

® h 1 1 2. " I (b i b i2ii + I la... 

i,j»l J J k=l 1 


% b i b Ajk> < ca > 


where 


2ij ’ I -~- F 1 


.C i. . _ 

-1-1— -g-l 


R, . = I. ,N. I . , 

— i]k — i-l— k-g-1 


The vector h can also be written as 


n i 

b= l I Vt-A-1 

i=l A-l 1 * ^ 1 


therefore 


1 1 .1 ^ A . 


n m 1 


i=l j=l A=1 




2N-4 


n m 1 


+ 1 III a. a. ..a b y ^ (C.2) 

k=l i,i 1 ,..,i k =l j=l £=1 1 x i x k D * 


where 



- 21 - 



The expected values of (C.l) and (C.2) will lead to the ex- 
pressions stated in the Proposition. If the vector X has correlated 

T 

variables, it can be substituted in (C.l) and (C.2) by X => where 

X x has independent variables. The resulting expectations will there- 
fore be similar, in terms of expected values of X^* This completes 


the proof. 



APPENDIX D 


The conditional error covariance matrix Eq. (2.5) corresponding 
to the system (5.4) is 



where £(k) * [y(k-l) .. 
also be written as 


(N/N) =* E " 1 + - 
' CT 

. y(k-n)u(k-l) ... 


m 

l c(k)c (k) (D.l) 

k=l 


u(k-m)J . Equation (5.5) can 


U(N) = F(N)Y(N) (D.2) 

where U(N) = [u(0) ... u(N)] T , Y(N) = [y(0) . . .y (N) ] T and 

A 

F(N) = l a.X, (D. 3) 

i=0 

where JL is the (N+l)x(N+l) shift matrix defined in a previous appendix. 

N+l 

Let < £,Y > denote the usual inner product in R . Thus using 
(D.2) in (D.l) we have that as N 4 


if 1 (N/N) 


<Y(N) ,Y(&) > <Y(N) fljYtN) >. ..<Y(N) /^^(N) > <Y(N) ,F(N)Y(N)>. 

♦ 

■ 

<1 _Y(N),Y(N)> 

~n~l~ — t 

<F(N) Y(N) ,Y(N) > 




\f (Vt\ ' 
± \L\ f < 


. <1 F(«)Y(N) ,F(N) Y(N) >. . 

~m—x — — ■ — 
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<Y(N) ,1 . F(N)Y(N)> 

— -m-l— — 


- 2 a.s. (D.4) 


<1 ,F(N)Y(N),I ,F(N)Y(N)> 

-m-l— — -m-l— — 


To check positive definiteness one can use Silvester's rule. By the 
properties of determinants and Eq. (D.3) it follows that if t < n the 
matrix (D.4) is singular. This implies that as K + ®, at least one 
eigenvalue of I(N/N) will go to infinity a.s. and tr = S(N/N) will not 

be integrable. Thus lim e{ (Y-y(N) ) T (y-Y(N> ) } = lira E{tr 2 (n/n) } + «. 

N‘ +0 ° ~ “ m+oo 
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01 23456789 10 


0 1 2345 6789 10 


Fig. 4.2 Optimal Inputs for System (4.3) 



01234 56789 10 01 23456789 10 


Fig. 4.3 Constant Inputs 
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